Predicting the Category and Attributes of Mental Pictures Using Deep Gaze Pooling

نویسندگان

  • Hosnieh Sattar
  • Andreas Bulling
  • Mario Fritz
چکیده

Previous work focused on predicting visual search targets from human fixations but, in the real world, a specific target is often not known, e.g. when searching for a present for a friend. In this work we instead study the problem of predicting the mental picture, i.e. only an abstract idea instead of a specific target. This task is significantly more challenging given that mental pictures of the same target category can vary widely depending on personal biases, and given that characteristic target attributes can often not be verbalised explicitly. We instead propose to use gaze information as implicit information on users’ mental picture and present a novel gaze pooling layer to seamlessly integrate semantic and localized fixation information into a deep image representation. We show that we can robustly predict both the mental picture’s category as well as attributes on a novel dataset containing fixation data of 14 users searching for targets on a subset of the DeepFahion dataset. Our results have important implications for future search interfaces and suggest deep gaze pooling as a general-purpose approach for gaze-supported computer vision systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Early detection of MS in fMRI images using deep learning techniques

Introduction & Objective:MS is a disease of the central nervous system in which the body makes a defensive attack on its tissues. The disease can affect the brain and spinal cord, causing a wide range of potential symptoms, including balance, movement and vision problems. MRI and fMRI images are a very important tool in the diagnosis and treatment of MS. The aim of this study was to provide...

متن کامل

Predicting of the Quality Attributes of Orange Fruit Using Hyperspectral Images

Background: Hyperspectral image analysis is a fast and non-destructive technique that is being used to measure quality attributes of food products. This research investigated the feasibility of predicting internal quality attributes, such as Total Soluble Solids (TSS), pH, Titratable Acidity (TA), and maturity index (TSS/TA); and external quality attributes such as color components (L*, a*, b*)...

متن کامل

The Female gaze in proportion to pictorial elements in "A parrot with fruit and a portrait of a girl"

Qajar painting influenced Iran’s painting with a new kind of illustrations originating from the past traditions. Art and cultural politics of Fath Ali Shah performs an obvious role amongst the influential agents and historical events in the era of Qajar paintings for a presentation of the concepts of power in the social, political and cultural arena. Fath Ali Shah’s patronage of the art alters ...

متن کامل

Bag of Attributes for Video Event Retrieval

In this paper, we present the Bag-of-Attributes (BoA) model for video representation aiming at video event retrieval. The BoA model is based on a semantic feature space for representing videos, resulting in high-level video feature vectors. For creating a semantic space, i.e., the attribute space, we can train a classifier using a labeled image dataset, obtaining a classification model that can...

متن کامل

Predicting the survival and dropout of addiction treatment interventions based on sensation seeking and impulsivity

The present study was conducted with the aim of predicting the survival and dropout of addiction treatment based on sensation seeking and impulsivity. The present study was descriptive-correlational. The statistical population of this study was all addicts in Ardabil city that came to one of the centers for addiction treatment and 349 of them were selected based on Krejcy and Morgan tables and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1611.10162  شماره 

صفحات  -

تاریخ انتشار 2016